Obstacle Avoidance through Reinforcement Learning

نویسندگان

  • Tony J. Prescott
  • John E. W. Mayhew
چکیده

A method is described for generating plan-like, reflexive, obstacle avoidance behaviour in a mobile robot. The experiments reported here use a simulated vehicle with a primitive range sensor. Avoidance behaviour is encoded as a set of continuous functions of the perceptual input space. These functions are stored using CMACs and trained by a variant of Barto and Sutton's adaptive critic algorithm. As the vehicle explores its surroundings it adapts its responses to sensory stimuli so as to minimise the negative reinforcement arising from collisions. Strategies for local navigation are therefore acquired in an explicitly goal-driven fashion. The resulting trajectories form elegant collisionfree paths through the environment

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Self-learning Fuzzy Navigation of Mobile Vehicle

This paper describes a self-learning navigation method which utilizes fuzzy logic and reinforcement learning for navigation of a mobile vehicle in uncertain environments. The proposed navigator consists of three modules: Obstacle Avoidance, Move to Goal and Fuzzy Behavior Supervisor. The fuzzy rules of the on-line obstacle avoidance are learnt through reinforcement learning. A new and powerful ...

متن کامل

Improving Reinforcement Learning of an Obstacle Avoidance Behavior with Forbidden Sequences of Actions

This paper is concerned with the improvement of reinforcement learning through the use of forbidden sequences of actions. A given reinforcement function can generate multiple effective behaviors. Each behavior is effective only considering the cumulative reward over time. It may not be the behavior expected by the designer. In this case, the usual solution is to modified the reinforcement funct...

متن کامل

Towards Monocular Vision based Obstacle Avoidance through Deep Reinforcement Learning

Obstacle avoidance is a fundamental requirement for autonomous robots which operate in, and interact with, the real world. When perception is limited to monocular vision avoiding collision becomes significantly more challenging due to the lack of 3D information. Conventional path planners for obstacle avoidance require tuning a number of parameters and do not have the ability to directly benefi...

متن کامل

A reinforcement learning based neural network architecture for obstacle avoidance in multi-fingered grasp synthesis

The ability to learn from interaction with the exterior world as well as variability are two main features of living organisms. The aim of this study is to present and discuss the property of a stochastic reinforcement learning-based model of upper limb posture generation that exhibits both properties. The capacity of the model to discover suitable postures satisfying task and obstacle avoidanc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991